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Abstract — In this paper we investigate, starting with a symmet- 
ric B-DMC, the evolution of various probabilities of the likelihood 
ratios of the synthetic channels created by the recursive appli- 
cation of the basic polarization transformations. The analysis 
provides a new perspective into the theory of channel polarization 
initiated by Arikan and helps us to address a problem related to 
approximating the computations of the likelihood ratios of the 
synthetic channels. 

Index Terms — Channel polarization, polar codes, min-sum 
approximation 

I. Introduction 

Polar coding is a recent technique introduced by Arikan |Q]] 
as an appealing error correction method; this class of codes 
are proved to achieve the symmetric capacity of any binary 
discrete memoryless channel (B-DMC) using low complexity 
encoders and decoders, and their block error probability is 
shown to decrease exponentially in the square root of the block 
length fl. 

The design of polar codes is based on a phenomenon 
called channel polarization. The notion makes reference to 
two extreme situations of communication over a noiseless 
(perfect) channel and completely noisy channel. In HI, Arikan 
describes a recursive process under which independent copies 
of a given B-DMC W : X — >• y can be combined to exhibit 
polarization. The basic building block of this recursion consists 
of two successive channel transformations W~ : X — > y 2 and 
W + : X — ► y 2 x X, whose transition probabilities are defined 
as 
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Referred as the basic polarization transformations, these con- 
stitute the elements of the design leading to the low complexity 
structure of the codes. 

To build the theory of polarization, [ 1 ] considers the prop- 
erties of the above transformations related to the symmetric 
capacities of the channels. Defined as 



x) log 



W(y I x) 



by now it is well known that these transformations Ul 

(i) preserve the sum symmetric capacity: 

I(W~) + I(W + ) = 21 (W), 

(ii) improve the channel in W + and worsen in W~~\ 

I(W~) < I(W) < I(W + ). 

This last property confirms that the evolution is in the right 
direction towards polarization. The idea now is to apply the 
same basic channel transformations to the channels W~ and 
W + . As a result, four channels W , W~ + , W + ~ ', and W ++ 
are obtained. However, one is no longer able to compare in 
general the parameters of these four channels in terms of rate, 
except the knowledge that the channel W ++ is the best one 
and the channel W~~ is the worst one. Instead of worrying 
about ordering the channels after a few steps, the theory 
is founded by analyzing the convergence properties of the 
polarization process obtained by applying the transformations 
to the synthesized ± channels in a long sequence of steps. 

Let (fi, J 7 , P) be a probability space. Assume the random 
sequence Si, ... , B n is drawn i.i.d according to a Bernoulli 
distribution with probabilities equal to ^. Let T n be the 
er-algebra generated by this Bernoulli sequence. Then the 
polarization process for a given channel W is defined |0] as 
the random sequence of channels {W n } such that Wq = W 
and 

r w- if B n = 

W n+ i = 



if B n = 1 
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for n > 0. In the sequel, the random process /„ = I(W n ) is 
defined and |1] proves the process {/„, F n } 

(iii) is a bounded martingale on the interval [0, 1], 

(iv) converges a.s. to a random variable 1^ such that 
E[ioo] = Iq, where 1^ takes values a.s. in {0, 1}. 

These cited two properties prove the recursive application of 
the basic polarization transformations lead to channel polar- 
ization, see (ll Theorem 1]. 

The goal of this paper is to analyze the convergence proper- 
ties of various random processes associated with the channel 
polarization process, as the ones described for the symmetric 
capacity process, but related this time to the likelihood ratios 



of the synthesized ± channels. We first apply this knowledge 
to revisit the theory of channel polarization for symmetric B- 
DMCs. Subsequently, we shift our attention to the performance 
of an approximation to the minus polarization transformation 
known as the min-sum approximation in the coding theory 
literature. We identify a structure sufficient to guarantee no 
performance loss is incurred by an approximation, and we 
argue slight modifications to the 'min-sum' approximation can 
improve the performance. 

The next section explores these results. The final section 
gives the conclusions. 

II. RESULTS 

Let W : X ->• y be a symmetric B-DMC. We define the 
likelihood ratio of this channel as L{y) = W (y\l) /W (y\0) 
for y e y. Similarly for each n > 0, the likelihood ratios of 
the 2 n channels W n : X -> y 2 " x X i_1 , for i = 1, . . . , 2™ 
are denoted as L n (y) for y e y 2 . 

Properties of the polar transforms 

In (jl Equations (74) and (75)] Arikan shows the synthetic 
channels' likelihood ratios follow a recursive structure along- 
side the polarization process. For a symmetric B-DMC, one 
can assume the all zeros sequence is sent through the channel. 
In this case, the corresponding likelihood ratio process can be 
defined as 



where 



£n+i(yiy2) = 



L n (yiy 2 ) = 



L n (yiya), if B n+X = 

i+(yiy 2 ), ifB n +i = l 

£n(yi) + L n (y 2 ) 

1 + Lniy^Lnfa)' 



i„(yiy2) = L n (y ± )L n (y 2 ). 

We denote P [.] = P[.|W n ] for shorthand notation. Let us 
define two auxiliary processes 

P [L n (y) > 1] = P [L n (y) > 1] + ip [L n (y) = 1] 

F [L n {y) < 1] = P [L n (y) < 1] + ip [L n (y) = 1] 

such that P [L n (y) > 1] + P [L n {y) < 1] = 1. 

The following two propositions investigate monotonicity 
properties of the processes P [L n (y) > 1] and P \L n (y) < 1]. 
Their proofs will be carried together. 

Proposition 1: Given that P[L„(y) > 1] < F[L n (y) < 1] 
holds for a particular n > 1, the polar transformations for the 
likelihood ratios satisfy 

F [L+( yi y 2 ) > 1] < P [L n (y) > 1] < P [i^(yiy 2 ) > l] 
F [L-(yiya) < l] < F[L„(y) < 1] < P [i+(yiy a ) < l] 

Proposition 2: Given that P[L„(y) > 1] < F[L n (y) < 1] 
holds for a particular n > 1, the basic polarization transfor- 
mations preserve this inequality, i.e. at the next level we get 

P [L-(yxy 2 ) > 1] < P [£-(yiy 2 ) < l] , 
P [L+(yiy 2 ) >l] <P[i+(yiy 2 )<l]. 



Proof of Propositions Q] and \2\ We first derive some 
useful expressions for the quantities of interest. After applying 
the minus transformation, we get 

P[£-(yiy 2 ) < l] =P[i n ( yi ) < l]P[L n (y a ) < 1] 
+ F[i«(yi) > l]P[£„(y 2 ) > l] 

= P[i„(y) < l] 2 +P[L„(y) > l] 2 , (1) 

P[i-(yiy 2 ) > l] =P[i«(y 1 ) < l]P[i„(y 2 ) > 1] 
+ P[£„(yi) > l]F[in(y 2 ) < 1] 

= 2P[£„(y) < l]P[L„(y) > 1] , (2) 

and 

P [L-(yxy 2 ) - 1] = P [L„(yi) = 1] + P [£„(y 2 ) = 1] 
-P[L n (yi) = l]P[£„(y 2 ) = l] 

= 2P[L„(y) = l]-P[L„(y) = l] 2 . (3) 

Using (f2]i and ©, we obtain similarly 

P[i,T(yiy 2 )>i] 

= P [L~(yiyi) > 1] + ip KT(yiy 2 ) = l] 

= 2P[L n (y)<l]P[L n (y)>l] (4) 
as by few simple manipulations we get 

P[£n(y) < i]P[in(y) > i] 

= P [L„(y) < 1] P [L B (y) > 1] + lp [L n (y) = 1] x 

(P [L B (y) < 1] + P [L„(y) > 1]) +ip [L n (y) = l] 2 
v ' 4 



1-P[i„(y)=l] 

= P [L n (y) < 1] P [L n (y) > 1] + ip [L„(y) = 1] 

-ip[i n (y) = l] 2 . (5) 

Hence, we also have 

P[i"(yiy 2 ) < l] =P[i„(y) < l] 2 +P[L„( y ) > l] 2 . (6) 
Noting the difference of the quantities in (O and equals 

(p [My) $ !] - p [My) ^ i]) 2 > o, (7) 

proves the claim of Proposition|2]for the minus transformation. 

On the other hand, by assumption P [L n (y) < 1] G [0.5, 1] 
holds. So, we have 

P [£n(yiya) > 1] = 2P [L n (y) < 1] P [L„(y) > 1] 

>P[i„(y) > 1], 

which also implies 

P[^(yiy 2 )<i] <P[i„(y)<i], 

proving the inequalities in Proposition Q] for the minus trans- 
formation. 



For the plus transformation, we use a property following 
the symmetry of the channels 

1" 



W{y\Q) = ^^-^[L n (y)=l] = \\ 



L n (y) = 



(8) 



Then, we can write 

P[i+(yiy 2 )<i] 

= E E p \. L M = ^ p = **i 

+ E E p l L n(yi)=ti}PlL n (y 2 )=l 2 } 

ti<i i<e 2 <i/e 1 

+ E E F l L M=h]v[Ln(y 2 )=h] 
ei>n 2 <i/ii 

-ip[L„(y) = l] 2 
=P [L„(y)<l] 2 -ip[L„(y) = l] 2 

+ E E ^ p [ £ n(yi)=4]P[in(y 2 ) = ^] 
+ E E ^ p \ L M = ti] p [£n(ya) = 

^1>1£ 2 >«1 

=P[L n (y)<l] 2 -ip[L„(y) = l] 2 

+ E E ^P[^n(yi)=4]P[in(y2) = ^] 
+ P[L n (y) = l]$^iP[L n (yi)=4] 

+ E E ^ p [ L »(y*) = ^ p t^"^ 2 ) = ^ 

+ P [L„(y) = 1] E ^ P t L «(yi) = 4] + P [Ln(y) = 1] 2 

£ 2 >1 

=f[L n (y) < 1] 2 

+ E E P t L ™(y!) = £l ] P = fc] max{^,i a } 



(9) 



where we abuse the notation to define 



E p ^Myi) = p [^(ya) = max{*i,* 2 } 

£i>l ^ 2 >1 

= E E p [ L «(yi) =^i]P[in(y 2 ) =4]max{4,M 



L>1 £ 2 >1 



+ P [L„ (y) = 1] X) flP [L„ ( yi ) = + ip [L„ (y) = 1] 



In the same spirit, we define 

E p [ L »(yi) =4]P[i„(y 2 ) =4]min{£ 1 ,4} 
= E E p I L «(yi) =4]P[i n (y 2 ) =4]min{4,^} 



^1>1£ 2 >1 



+ P [L n (y) = 1] P [L„(y) > 1] + -P [L n (y) = 1]< 



and we note that 

E E p My*) = ^ p t L «(y 2 ) = ^ 

£i>l £ 2 >1 

(max{4,4} + min{4,^ 2 }) 

= E E p [ L «(y!) = 'i] p [ L «(y 2 ) = ^ ( £i + £2 ) 

£l>l^ 2 >l 

= 2 £ P [i n ( yi ) = 4] £ x ]T P [L„(y 2 ) - £ 2 ] 

£i>l £ 2 >1 

= 2P[L„(y) < l]P[i„(y) > 1] . (10) 

As 

1 = P [L+( yi y 2 ) < 1] + P [L+(yiy 2 ) > l] 
= (P[-My) S 1]+P[£„(y) > i]) 2 

must hold, we get 

P[i+(yiy 2 ) > 1] =P[i„(y)>l] 2 

+ E E p LMyi) =4]P[in(y 2 ) = e 2 }mm{£ 1 ,e 2 } 

e 1 >ie 2 >i 

(11) 

Therefore, ((9) and (fTTl proves that 

P[L+(yiy 2 ) <1] >P[i+(yiy 2 ) > 1] 

holds as claimed by Proposition [2] 

On the other hand, we can decompose P [L n (y) > 1] into 

P[in(y) > l] =P[i n (y) > i] 2 

+ P [L n (y) > 1] P [L n (y) < 1] + ~P [L„(y) = 1] 



= (P[L„(y)>l]--P[L„(y) = i; 



+ P [i„(y) > 1] P [L n (y) < 1] + -P [i„(y) = 1] 

= P [£„(y) > l] 2 + P [L n {y) > 1] P [L n {y) < 1] 

+ ip[L„(y) = l]-ip[L„(y) = l] 2 

= P [L n (y) > i] 2 + P [£„(y) < 1] P [L n (y) > 1] (12) 

where we used the derivation in (0 to get the final equality. 
Comparing the expressions in (fTTl and (TT2l in the light of 
([Tol l, we see that 

P[L+(yiy 3 ) > 1] <P[L„(y) > 1], 

which also implies 

P[L+(yiy 2 ) < 1] >P[L„(y) < 1] . 

proving the claimed inequalities in Proposition [TJ for the plus 
transformation. ■ 
Next, we show the average of the transformed plus and 
minus quantities also satisfy some monotonicity properties. 



Proposition 3: The following set of inequalities hold: 

P [L~(yiy a ) > l] + P [L+( yi y 2 ) > l] > 2P [L n (y) > 1] , 

(13) 

P [^(yiy 2 ) < 1] + P [L+(yiy a ) < 1] < 2P [L„(y) < 1] , 

(14) 

P [L-(yiY2) = i] + p [i+(yiy a ) = l] > 2P [z n (y) = 1] . 

(15) 

Hence, we also have 

P [L-(yiY2) < 1] +P [i+(yiya) < 1] < 2P[£„(y) < 1] , 
P [i„(yiy 2 ) > l] +P [i+(yiy 2 ) > l] > 2P[L„( y ) > l] . 

Proof: We start by proving the inequality in ( [T3l >. Using 
the expressions derived in (@) and (fTTT > show that 

P[£,7(yiy 2 ) >1] +P[ J L+(y 1 y 2 ) >1] 

= 2P [L n (y) > 1] P [L n (y) < 1] +P [L n (y) > l] 2 
1-P[i„(y)$l] 

^i>i t 2 >i 

= 2P[£„(y) > 1] -P[L„(y) > l] 2 
+ HP[^(yi)=^i]P[ L «(y2)=^]min{4,€ 2 } 

^1>1^2>1 

>2P[i„(y) > 1] 

where the inequality follows from 

£l>l f 2 >l 

>P[i«(y) > i] 2 . (16) 

This also proves the inequality in (fl4l in view of the relation 
P[L„+i(yiy 2 ) < 1] = 1 - P[L„+i(yiy 2 ) > 1]. Finally, to 
prove (fT5l l. we write 

P[L-(yiy 2 ) = l] +P[L+( yi y 2 ) = l] 

> 2P [L n (y) = 1] - P [£„(y) = if + P [L n (y) = l] 2 

where we used © and simply noted that P [i+(yiy 2 ) = 1] > 
P[i„(y) = l] 2 holds. ■ 
Before we discuss the implications of the inequalities in 
Proposition [3] on the processes, we define another channel 
parameter as 

Q n =P[i„(y) < 1]-P[i n (y) > 1] 

= P[L„(y) < l]-P[L„(y) > 1]. 

The one step transformations of Q n are given by 
Proposition 4: 

_{Q-, if £„+! = () 

y " +1 -lQ+ ifB n+1 = l 



where 



Proof: From the derivation of ©, we immediately get 
Q~ = Q 2 . Moreover, Proposition Q] implies Q+ > Q n . On 
the other hand, using (0 and (TTTb we have 

Q+=P[L n (y)<l] 2 -P[Z n (y)>l] 2 

+ EE P t L «(yi) = *i] p [ L «(y 2 ) = ^] x 

^1>1^2>1 

(max^^-min^M) 

= Qn + E E p \- L M = ^ p t^ 2 ) = ^] x 



£l>l «2>1 



Moreover, note that 



{max{£ 1 ,i 2 }-min{l 1 J 2 }). 



Qn Qn Q n{\ Qn) 

= (P [i„(y) < i] - p [L n {y) > 1]) 2P [L n {y) > 1] 
= 2P [L n (y) > 1] P [L n (y) < 1] - 2P [L„(y) > l] 2 
as 1 - Q n = 2P [L n {y) > 1]. Therefore, 

2Q„ - Q 2 - Q+ 

= 2P [L„(y) > 1] P [L„(y) < 1] - 2P [L„(y) > l] 2 + Q n 

-Qn-Y, H p [ i »(yi) =4]P[in(y 2 ) =h] x 

fl>l« 2 >l 

(max{^,4}-min{£i,£ 2 }). 
Now, using the expression in (TT~Qb instead of 

2P[£»(y) £ l]P[i«(y) < 1] we get 

2Q„-Q 2 -Q^ = -2P[L„(y)>l] 2 

+ EE P t L «(yi) = ^] p [ L «fr 2 ) = ^] x 

^1>1^2>1 

(max{£i,£ 2 } + niin{4,^2}) 

-EE P t L «(y!) = f i] p [ L «(y 2 ) = £ 2] x 

fl>lf 2 >l 

(max{4^ 2 }-min{^i^ 2 }) 
= 2 ^ P = ^ P t L «(y 2 ) = ^] min{€i, £2} 



1>1^2>1 



2P[L„(y) > l] 2 >0 



Q+ e [Q„,2Q„-Q 2 ] 



where the non-negativity is due to ( TToT l once again. ■ 
Corollary 1: The BEC is an extremal channel in the evo- 
lution of the process Q n . 

Proof: The proof follows by noting that being a BEC 
is preserved under the polarization transformations [1] with 
Q+ = 2Q n - Q 2 . ■ 
Now, we discuss the convergence properties of the processes 
we considered so far. 

Proposition 5: Let If be a symmetric B-DMC such that 
P[i (y) > 1] < P[io(y) < 1] holds. Then, 

(i) The process Q n is a bounded supermartingale in [0, 1] 
and converges a.s. to {0, 1}. 

(ii) The process P (L n (y) > 1) is a bounded submartingale 
in [0,0.5] and converges a.s. to {0,0.5}. 



(iii) The process P (L n (y) = 1) is a bounded submartingale 
in G [0, 1] and converges a.s. to {0, 1}. 

(iv) The process P(L n (y) < 1) is a bounded supermartin- 
gale in G [0.5, 1] and converges a.s. to {0.5, 1}. 

Proof: The assumption on the channel W implies via 
Proposition Q] that P[L„(y)>l] < P[L n (y) < 1] holds 
for all n = 1,2,.... This constraints the probabilities 
to P[£„(y)>l] G [0,0.5], P[L n (y) < 1] G [0.5,1], 
P[L„(y) = 1] G [0,1], from which the boundedness state- 
ments follow. 

The inequalities proved in Proposition[3]shows the processes 
are the claimed martingales. From general results on bounded 
martingales, it follows the processes converge a.s. The only 
part left is to prove the convergence is to the extremes of 
the bounded intervals. For the process Q n , we know by 
Proposition 2] that Q~ — Q\. One can complete the proof that 
Q n converges to the extremes using this relation in a similar 
fashion as in the proof of [ 1 , Proposition 9] of the convergence 
to the extremes of the Bhattacharyya process of the synthetic 
channels associated with the polarization transformations: 



E[|Q„+i-Q„| 



71— »00 

1, 



-> 



=ME[|Q n+1 -Q n \}> -E[Q n (1 - Q n )] > 0, 

Z n— >ao 

whence Qoo G {0,1}. Similarly, we know by (O that 



1) Qoo = 1, PiLoo = 1] = 1: As Qoo = PJL^ < 
1] - P[Loo > 1] = 1 holds, we find P[Loo < 1] = 1, 
contradicting P[Lqo = 1] = 1. So, this case is not 
possible. 

2) Qoo = 1, P[ioo = 1] = 0: We look at a perfect channel. 

3) = 0, PfLoo = 1] = 1: We look at a completely 
noisy channel. 

4) Qoo = 0, PP[Loc = 1] = 0: These constraints 
only tell us P[Loo < 1] = P[Loo > 1] = 0.5 and 
P[ioo = 1] = 0. Hence, we are looking at a 'completely 
moderate' channel. However, Proposition Q] shows that 
the polar transforms are monotone for the probabilities 
of the likelihood ratios. Consequently, this case will 
not occur unless we start with a channel at the state 
P[L < 1] = P[L > 1] = 0.5, but this would violate 
the symmetry condition. 

Note that we still need the preservation of the sum capacities, 
i.e. I(W n ) being a martingale, to show that the fraction of 
perfect channels is I(W). 

Moreover, the results on the rate of convergence of polar 
codes [2] can be stated in terms of Q„: note that the conditions 
(z.l), (z.2), (z.3) in [2] still hold with Z n replaced by Q n , 
and with the condition P \Z X = 0] = Iq in (z.3) replaced by 
P [Qoo = 0] = 1 - J . 



[L n (yiy2) — 1] — 2P [L„(y) — 1] P [L„(y) — 1] holds, Properties of an approximation to the polar transforms 



so that once again P [Loo(y) = 1] G {0, 1} since 

E[|P[L n+1 (yiy 2 ) = 1] -P[J n (y) = 1]|] ► 

n— »oo 

=>E[|P[L n+1 ( yi y 2 ) = 1] -P[J„(y) = 1]|] 

— > 



1, 



> -E[P [L„(y) = 1] (1 - P [L„(y) = 1])] - 

2i It — t LXJ 

Now once Q„ and P [Loo (y) = 1] converge to their extremes, 
the remaining probabilities can only converge to the extremes 
claimed by the proposition. ■ 

Channel Polarization Revisited 

Now, we revisit the theory of channel polarization for 
symmetric B-DMCs. Let us start by describing a perfect 
channel and a completely noisy channel in terms of the channel 
parameters we have discussed so far. It is easy to see that the 
channel is perfect when Q(W) = 1, which is possible only 
when P[Joo < 1] = 1, F[Joo = 1] = P[Joo > 1] = 
hold. Without any surprise, we get I(W) — 1 in this case. 
On the other hand, the channel is completely noisy when 
P[Joo = 1] = 1, P[Joo < 1] = P[Joo > 1] = 0, giving 
Q(W) = and I(W) = 0. 

At this point, we can simply eliminate the other possibil- 
ities as we know I n is a bounded martingale process with 
Joo G {0,1} from UD and capacity cannot be created. These 
are exactly the arguments proving once channels are polarized 
the fraction of moderate channels vanishes. Yet, let us ignore 
this knowledge for a moment to simply look to the four 
possible combinations of the pair Qoo and P [Loo = 1], two 
of which we hopefully 'never' end up with. 



In this section, we discuss the performance of an approxi- 
mation to the minus transformation which appears in [3] and 
0]. The min-sum approximation, as called in the literature, is 
defined as 

logL-( yi y 2 ) - -signal * £ 2 ) min{|4|, \l 2 \} (17) 

where t\ = \ogL n (y 1 ),£ 2 = logL„(y 2 ). While proposed 
in Q3|] for efficient hardware implementations of polar codes, 
|01 considers the performance of mismatched polar codes 
designed using the approximation over binary symmetric chan- 
nels (BSC). 

First, we argue some of the derivations of the previous 
section extend as well to the approximate process defined as 



Jn+i(yiy2) = 



J„(yiy2), ifL„+i=o 
L+{yiy 2 ), if B n+1 = 1 



where 



J„(yiy2) =cxp j -sign (log L„(yi)* log L„(y 2 )) x 
min 1 1 log L„ (y i ) | , | log L„ (y 2 ) 1 1 1 

J^(yiya) = Jn(yi)L„(y 2 ) 

This is explained by the fact that the approximate minus 
transformation of the likelihood ratios satisfy, as the exact case, 



the following properties: 

1) |X+i(yiy2) > 1} 

{£«(yi) > 1} n {i„(y 2 ) < 1} 
(J{i„( yi ) < i}n{Z„(y 2 ) > 1} 

2) {L n+1 ( yi y 2 ) = l}, 

{X n (yi) = i} U{ z «(y2)-i}, 

3) {L n+1 ( yi y 2 ) < l} 

{£«(yi) > 1} n {i„(y 2 ) > 1} 
(J{i„( yi ) < i}n{Z„(y 2 ) < 1}. 

Hence, the below counterparts to (0, (0) and © continue to 
hold. 

1 2 



(yiy2) = 1 
L~(yiy 2 ) > 1 



= 21 



in(y) = i -p i«(y) = i 



(18) 



L n (y) < 1 P L„(y) > 1 



(19) 



(yiya) S ij = p pn(y) < ij + p [£„(y) > 1 

(20) 

Similarly, for the plus transformation as the symmetry in the 
likelihood ratios is preserved by the approximation, one can 
use the LHS of <[8J to derive the below counterparts to ((9) and 
CED: 

P[i+(yiy 2 ) < 1] =P[£«(y) < 1] 2 

+ W(yi|0)W(y a |0)max{L n (yi),L n (y a )} (21) 

yiy2: 
£»(yi)>i 
£»(y 2 )>i 



pp+(yiy 2 )>ij = P[£„(y)<if 

+ ]T W( yi \0)W(y 2 \0)mm{L n ( yi ),L n (y 2 )} (22) 
yiy= : 

i n (y 2 )>i 

As a result, one can carry the proofs of Propositions Q~]and |2] 
in exactly the same way by replacing the uses of ©, (|4), (0, 
®, and £EU by flU), dHJl, <ED, and (f22]i, respectively. 
On the other hand, for a given L(y 2 ) 7^ 1, while the exact 
minus transformation is strictly monotone in L(yi) (increasing 
or decreasing), the approximate one is no longer strictly but 
simply monotone. So, one particular difference caused by the 
minus approximation is identical likelihood ratios obtained for 
some outputs which would otherwise be different from each 
others. Hence, following the approximation a plus transfor- 
mation at the next level will result in more outputs having 



likelihood ratios equal to one. Whether ultimately this would 
cause loss in the performance is an open problem, i.e. we do 
not know if 



Loo(y) < l|ioo(y) = 1=1 



(23) 



is possible. 

A sufficient condition to avoid the above situation from 
happening is the following: 



If 



(24) 



{L n (y) < 1} = {L n (y) < 1}, 
{L n (y) > 1} = {L n (y) > 1} 

{£«+i(yiy 2 ) < 1} = {^n+i(yiy 2 ) < i}, 
{in+i(yiy 2 ) > 1} = {^n+i(yiy 2 ) > 1} 

Consequently, no performance degradation would be incurred 
by such an approximation. The process Q n would behave 
exactly as the process Q n , and the synthetic channels created 
by the approximate transformations would also polarize with 
Qoo 6 {0,1}- 

Now, we discuss how the min-sum approximation can be 
modified to attain this goal. The idea is to slightly perturb 
the identical likelihood ratios forced by the approximation to 
distinct values while keeping the symmetry, and the order 

If L(yi) < L(y 2 ) => L( yi ) < Z(y 2 ), Vyi,y 2 . 

In this case, this new version of the approximation would 
satisfy (|241l . The real trouble might be to find such an approx- 
imation 'better' than the exact case. Still, as we simply want 
to avoid ( f23l , trading-off the order preservation requirement, 
the slight perturbations might still prevent the fraction of 
likelihood ratios of value 1 to dominate the approximate case 
as opposed to the exact case in some of the synthetic channels. 

III. CONCLUSIONS 
In this paper we investigated, starting with a symmetric 
B-DMC, the evolution of various probabilities related to the 
likelihood ratios of the synthetic channels created by the 
recursive application of the polarization transformations. We 
showed the processes are bounded martingales converging to 
the extremes of the bounded intervals using similar proof 
techniques used in [Q]] and the inherent symmetry in the 
channels. The analysis helped us to consider the approximation 
given in Equation (flTT i for the likelihood ratio recursion. 
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